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Abstract 

In practice, the information regarding the population proportion possessing certain 
attribute is easily available see Jhajj et.al. (2006). For estimating the population mean Y 
of the study variable y, following Bahl and Tuteja (1991), a ratio-product type 
exponential estimator has been proposed by using the known information of population 
proportion possessing an attribute (highly correlated with y) in simple random sampling. 
The expressions for the bias and the mean-squared error (MSE) of the estimator and its 
minimum value have been obtained. The proposed estimator has an improvement over 
mean per unit estimator, ratio and product type exponential estimators as well as Naik 
and Gupta (1996) estimators. The results have also been extended to the case of two 
phase sampling. The results obtained have been illustrated numerically by taking some 
empirical populations considered in the literature. 
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1. Introduction 



In survey sampling, the use of auxiliary information can increase the precision of 
an estimator when study variable y is highly correlated with the auxiliary variable x. but 
in several practical situations, instead of existence of auxiliary variables there exists some 
auxiliary attributes, which are highly correlated with study variable y, such as 
(i) Amount of milk produced and a particular breed of cow. (ii) Yield of wheat crop and 
a particular variety of wheat etc. (see Shabbir and Gupta(2006)). 

In such situations, taking the advantage of point biserial correlation between the 
study variable and the auxiliary attribute, the estimators of parameters of interest can be 
constructed by using prior knowledge of the parameters of auxiliary attribute. 

Consider a sample of size n drawn by simple random sampling without 
replacement (SRSWOR) from a population of size N. let y ; and (j), denote the 
observations on variable y and (j) respectively for the i th unit (i = 1,2,..., N). We note that 
(();= 1, if i th unit of population possesses attribute (j) and cj); = 0, otherwise. Let 

N n 

A = ^(j) ; and a = ^ <j> , denote the total number of units in the population and sample 

i=l i=l 

Al. Cl 

respectively possessing attribute^ . Let P = — and p = — denote the proportion of units 

N n 

in the population and sample respectively possessing attribute (j) . 

In order to have an estimate of the population meanY of the study variable y, 
assuming the knowledge of the population proportion P, Naik and Gupta (1996) defined 
ratio and product estimators of population when the prior information of population 
proportion of units, possessing the same attribute is available. Naik and Gupta (1996) 
proposed following estimators: 
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- p 

tj = y — 

Ip 



= y - 



The MSE of tj and t 2 up to the first order of approximation are 



MSE(tj) = f,Y 2 [Cy +Cp(l- 2K p )] 
MSE(t 2 ) = f,Y 2 [Cy +Cp(l + 2K p )] 



, S 2 , S 2 11 

whereC 2 = ^,C 2 = -*-,f, = 

y Y 2 p P 2 1 n N 






^=^Z(t-^.S rt =^_Z yA -NPY 



p b = — - — is the point biserial correlation coefficient. 

S y S^) 

Following Bahl and Tuteja (1991), we propose the following ratio and product 
exponential estimators 



t 3 = yexp 



P-P 
P + p 



t 4 = yexp 



P-P 
p + P 



2 . Bias and MSE of t3 and t4 

To obtain the bias and MSE of t 2 to the first degree of approximation, we define 
e y = ^ , e + = , therefore E (e ; ) = 0. i = (y, cj)) , 



E(e 2 ) = f,C 2 , E(e 2 ) = f,C 2 ,E(e v e , ) = f lPpb C v C p . 
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Expressing (1.5) in terms of e’s, we have 



*3 



Y(l + e y )exp 



P~P(l + e t ) 
P + P(l + e^) 



= Y(l + e y )exp 



(2 + e,) 



( 2 . 1 ) 



Expanding the right hand side of (2.1) and retaining terms up to second powers of e’s, we 
have 



*3 



= Y 



1 + e. 



e, eT e e, 

<l> _ y <t> 

2 8 2 



(2.2) 



Taking expectations of both sides of (2.2) and then subtracting Y from both sides, we get 
the bias of the estimator t 3 up to the first order of approximation, as 






— -K 
4 p 



(2.3) 



From (2.2), we have 



(t, - y)= Y 




(2.4) 



Squaring both sides of (2.4) and then taking expectations we get MSE of the estimator t 3 , 
up to the first order of approximation as 



MSE(t,) = f 1 Y 2 





(2.5) 



To obtain the bias and MSE of t 4 to the first degree of approximation, we express (1.6) in 
terms of e’s 
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MSE(t 4 ) = fjY 2 Cy +Cp(-^- + K p ) (2.8) 

3. Proposed class of estimators 

It has been theoretically established that, in general, the linear regression 
estimator is more efficient than the ratio (product) estimator except when the regression 
line of y on x passes through the neighborhood of the origin, in which case the 
efficiencies of these estimators are almost equal. Also in many practical situations the 
regression line does not pass through the neighborhood of the origin. In these situations, 
the ratio estimator does not perform as good as the linear regression estimator. The ratio 
estimator does not perform well as the linear regression estimator does. 

Following Singh and Espejo (2003), we propose following class of ratio-product 
type exponential estimators: 

f p_ x f -pV 

t 5 =y a exp — +(l-a)exp — (3.1) 

l p +p J Ip+ p J_ 

where a is a real constant to be determined such that the MSE of ts is minimum. 



For a=l, t 5 reduces to the estimator t 3 = yexp 



p -p " 
P + P> 



and fora= 0, it reduces to 



t 4 =yexp 



P-P " 

P + P/ 
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Bias and MSE of ts: 



Expressing (3.1) in terms of e’s, we have 



, { [P-P(l + eJ] [P(l + e + )-P 

1 P + Pl + eJ P(l + eJ + P 



= Y(l + e y j aexp< > + (l-a)exp< -y- 



Expanding the right hand side of (3.2) and retaining terms up to second powers of e’s, we 



t 5= Y 1 + e y + — +v + e y e + ~ ae y e 4 

Z o 



Taking expectations of both sides of (3.3) and then subtracting Y from both sides, we get 
the bias of the estimator t 5 up to the first order of approximation, as 



B(t s ) = f,Y % + p pb C,C fi-a 

o Z 



From (3.3), we have 



k-Y)=Ye ,+eft 



Squaring both sides of (3.5) and then taking expectations we get MSE of the estimator t 5 , 
up to the first order of approximation as 



MSE(t 5 ) = f[Y 2 c;+C;|i + a ! -a] + 2p pl C 1 ,cJi-a 

V 4 7 V 2 



Minimization of (3.6) with respect to a yields optimum value of as 



2K + 1 

a = — - — = a 0 (Say) 
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Substitution of (3.7) in (3.1) yields the optimum estimator for t 5 as (ts) 0 pt (say)with 
minimum MSE as 

min .MSE(t 5 ) = f 1 Y 2 C;(l- P ; b )=M(t 5 ) opt (3.8) 

which is same as that of traditional linear regression estimator. 

4. Efficiency comparisons 

In this section, the conditions for which the proposed estimator ts is better than y , ti, 
t 2 , t 3 , and t 4 have been obtained. The variance of y is given by 

var(y) = f,Y 2 C 2 (4.1) 

To compare the efficiency of the proposed estimator ts with the existing estimator, from 
(4.1) and (1.3), (1.4), (2.5), (2.8) and (3.8), we have 



var(y) - M(t 5 ) 0 = p 2 b > 0. (4.2) 

MSE(t,)-M(t s )„ =(C„ -p p ,C,) 2 >0. (4.3) 

MSE(t 2 )-M(t 5 ) 0 =(c p + Ppb C y ) 2 >0. (4.4) 

f C 2 y 

MSE(tj) -M(t 5 ) 0 = -^- Ppb C y >0. (4.5) 

V J 

(c 2 Y 

MSE(t 4 )-M(t 5 ) 0 = + p pb C y >0. (4.6) 

V 2 ) 



Using (4.2)-(4.6), we conclude that the proposed estimator t 5 outperforms y , ti, t 2 , t 3 , and 

U. 

5. Empirical study 

We now compare the performance of various estimators considered here using the 
following data sets: 
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Population 1. [Source: Sukhatme and Sukhatme (1970), p. 256] 

y = number of villages in the circles and 

cj) = A circle consisting more than five villages. 

N = 89, Y =3.360, P = 0.1236, p pb = 0.766, C y = 0.60400, C p = 2.19012. 

Population 2. [Source: Mukhopadhyaya, (2000), p. 44] 

Y= Household size and 

cj) = A household that availed an agricultural loan from a bank. 

N = 25, Y =9.44, P = 0.400, p pb = -0.387, C y = 0.17028, C p = 1.27478. 

The percent relative efficiency (PRE’s) of the estimators y , ti-t 4 and (t 5 ) op t with respect to 
unusual unbiased estimator y have been computed and compiled in table 5.1. 

Table 5.1: PRE of various estimators with respect to y . 



Estimator 


PRE’s (., y ) 




Population 




I 


II 


y 


100 


100 


ti 


11.63 


1.59 


t2 


5.07 


1.94 


t3 


66.24 


5.57 


U 


14.15 


8.24 


(ts)o 


241.98 


117.61 
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Table 5.1 shows that the proposed estimator t 5 under optimum condition performs 
better than the usual sample mean y, Naik and Gupta (1996) estimators (ti and ( 2 ) and 
the ratio and product type exponential estimators (t 3 and t 4 ). 

6. Double sampling 

In some practical situations when P is not known apriori, the technique of two-phase 
sampling is used. Let p’ denote the proportion of units possessing attribute^ in the first 
phase sample of size n’; p denote the proportion of units possessing attribute^ in the 
second phase sample of size n < n’ and y denote the mean of the study variable y in the 
second phase sample. 

When P is not known, two-phase ratio and product type exponential estimator are 
given by 



t 6 = yexp 



'p-p' 



p+pj 



t 7 = yexp 






P~P 
P + P' 



To obtain the bias and MSE of te and t 7 , we write 
y = Y(l + e y ), p = P(l + e 4 ), p'=P(l + e'J 
such that 

E(e y )=E(eJ= E(e\)=0. 



( 6 . 1 ) 

(6.2) 



and 



E(e;)=f,c;, E( e ;)=f,C=, E(e',) 2 =f 2 C=, E(e,e’J = f 2 p pl C y C„ . 
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1 1 



where E = 



n' N 



Expressing (6.1) in terms of e’s, we have 



t 6 = Y(l + e v )exp 



P(l + e',)-P(l + e,) 
P(l + e\) + P(l + eJ 



= Y(l + e v )exp 



e *-e. 



(6.3) 



Expanding the right hand side of (6.3) and retaining terms up to second powers of e’s, we 
have 



t«=Y 



e\ e, e'l e 2 e,e', e e\ e e, 
^_l_ e 4> _<!>_ _| <l> ! <t> <l> j> ! y <l> y 4 

y 2 2 8 8 4 2 2 



(6.4) 



Taking expectations of both sides of (6.4) and then subtracting Y from both sides, we get 
the bias of the estimator t 6 up to the first order of approximation, as 



C 2 

B(0 = f,Y^(l-2K p ) 



(6.5) 



From (6.4), we have 



k-Y)=Y 



(e'.-ej 



6 ’ + ' 2 



( 6 . 6 ) 



Squaring both sides of (6.6) and then taking expectations we get MSE of the estimator t 6 , 
up to the first order of approximation as 



MSE(t 6 ) = Y : 



f.Cj +f 3 ^Y(l-4K p ) 



(6.7) 



To obtain the bias and MSE of t 7 to the first degree of approximation, we express (6.2) in 
terms of e’s as 
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t 7 = Y(l + e )exp 



P(l + e,)-P(l + e;) 
Pd + ej + Pd + eV) 



= Y(l + e )exp 



e, - e. 



Expanding the right hand side of (6.8) and retaining terms up to second powers of e’s, we 



e. e\ e'( e( e A e', e e, e e' 
<l> «t> . <► . «t> <t> + . y <► y ' 



t 7 = Y 1 + e + -*- ^ + ^ + ^ + 

7 y 2 2 8 8 4 2 2 



Taking expectations of both sides of (6.9) and then subtracting Y from both sides, we get 
the bias of the estimator t 7 up to the first order of approximation, as 



B(t 7 ) = f,Y < k(l + 2K [ ) 



(6.10) 



From (6.9), we have 



(t, - y)= Y e + 



(e* -e'J 



(6.11) 



Squaring both sides of (6.11) and then taking expectations we get MSE of the estimator 
t 7 , up to the first order of approximation as 



MSE(t 7 ) = Y 2 f t Cy + f 3 (l + 4K p ) 



( 6 . 12 ) 



7. Proposed class of estimators in double sampling 

We propose the following class of estimators in double sampling 



p-p . p - p 

t g = y oqexp +(l-a,)exp 

Ip'+pJ Ip + p 1 



where is a real constant to be determined such that the MSE of t 8 is minimum. 
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For a,=l, t 8 reduces to the estimator t 6 =yexp 



'pV 

vP'+Py 



and fora,= 0, it reduces to 



t 7 = yexp 



W) 

p+p'y 



Bias and MSE of t 8 : 

Expressing (7.1) in terms of e’s, we have 

P(l + e'^ ) -P(l + ecj)) ] 
P(1 + e'* ) + P(l + e(|)) 



t 8 = Y ( 1 + e y J exp<j 



+ (l-a 1 )exp- 



= Y ( 1 + e yj a i ex p] r + (! - a i) ex p] -^r— 1 



P(l + e+)-P(l + e' 4 ) 
P(l + e + ) + P(l + e' + ) 

(7.2) 



Expanding the right hand side of (7.2) and retaining terms up to second powers of e’s, we 
have 



e A e A el e\ 2 e y e^ e y e', 



t 8 = Y[l + e y + y- Y-a ie+ +a ie ; + y + ^ + 



8 8 2 2 



e A e . 



'+ a i e y e '(|,- a i e y e <|,] 



(7.3) 



Taking expectations of both sides of (7.3) and then subtracting Y from both sides, we get 
the bias of the estimator t 8 up to the first order of approximation, as 



_ C ; 

B(t 8 ) = f 3 Y-7f 
From (7.3), we have 
(t 8 - Y) = Y| 



1-8K, 



a. — 

V 2y 



(7.4) 





r n 






n 




e v - 


a, — 


e , + 


a. 


— 


e ’ 


y 


V ' 2 J 


9 


V 1 


2; 


9 



(7.5) 
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Squaring both sides of (7.5) and then taking expectations we get MSE of the estimator t 8 , 
up to the first order of approximation as 

MSE(t 8 ) = Y 2 

Minimization of (7.6) with respect to otj yields optimum value of as 



f C +f C 



a. — 
1 2 



a. — 

. 2y 



-2K, 



(7.6) 






2K p +l 

2 



a i 0 (Say) 



(7.7) 



Substitution of (7.7) in (7.1) yields the optimum estimator for t 8 as (ts) op t (say) with 
minimum MSE as 

min .MSE(t g ) = Y 2 C 2 y (f t -f 3 p 2 b )=M(t 8 ) 0 , (say) (7.8) 

which is same as that of traditional linear regression estimator. 

8. Efficiency comparisons 

The MSE of usual two-phase ratio and product estimator is given by 

MSE(t 9 ) = Y 2 [f,C 2 +f 3 C 2 (l-2K p )J (8.1) 

MSE(t 10 ) = Y 2 [f,C; + f 3 C p (l + 2K p )] (8.2) 

From (4.1), (6.7), (6.12), (8.1), (8.2) and (7.8) we have 



var(y)-M(t g ) 0 =f 3 p pb >0. 



MSE(t 6 )-M(t 8 ) 0 

MSE(t 7 ) -M(t g ) 0 
MSE(t 9 ) -M(t g ) 0 




PpbCy 

7 



> 0 . 



= f. 



c „ 



> 0 . 




(8.3) 

(8.4) 

(8.5) 

( 8 . 6 ) 
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MSE(t 10 )-M(t g ) 0 = f 3 (C p + Ppb C y ) 2 >0. 



(8.7) 



From (8.3)-(8.7), we conclude that our proposed estimator t 8 is better than y , t 6 , t 7 , t 9 , and 
tio- 

9. Empirical study 

The various results obtained in the previous section are now examined with the 
help of following data: 

Population 1. [Source: Sukhatme and Sukhatme( 1970), p. 256] 

N = 89, n' = 45, n = 23, y = 1322, p = 0.1304, p'= 0.1333, p ph = 0.408, C y = 0.69144, 

C p = 2.7005. 

Population 2. [Source: Mukhopadhyaya( 2000), p. 44] 

N = 25, n' = 13, n = 7, y = 7.143, p = 0.294, p' = 0.308, p pb = -0.314, C y = 0.36442, 
Cp=l. 34701. 

Table 9.1: PRE of various estimators (double sampling) with respect to y . 



Estimator 


PRE’s (., y) 




Population 




I 


II 


y 


100 


100 


h 


40.59 


25.42 


h 


21.90 


40.89 


t9 


11.16 


8.89 


tio 


7.60 


12.09 


(t8)o 


112.32 


106.74 
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Table 9.1 shows that the proposed estimator t 8 under optimum condition performs better 

than the usual sample mean y , t 6 , t 7 , t 9 , and ti 0 . 
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